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Abstract 

Let H n be the hypercube {0,l} ri , and let H n ^ p denote the same 
graph with Bernoulli bond percolation with parameter p = n~ a . It 
is shown that at a = 1/2 there is a phase transition for the metric 
distortion between H n and H n>p . For a < 1/2, asymptotically there is 
a map from H n to H n ^ p with constant distortion (depending only on 
a). For a > 1/2 the distortion tends to infinity as a power of n. We 
indicate the similarity to the existence of a non-uniqueness phase in 
the context of infinite nonamenable graphs. 



1 Introduction 

The hypercube H n is the graph with vertex set {0, 1}" and edges between two 
vectors differing in a single coordinate. Bernoulli bond percolation on a graph 
G with parameter p is a distribution on subgraphs H C G where each edge 
is in H with probability p independently of all other edges. Site percolation 
is defined similarly except that each vertex is in H with probability p and 
all edges with endpoints in H are in H. H n)P is the graph resulting from 
bond Bernoulli percolation on H n . We view a graph G as a metric space on 
its vertex set V(G) with the induced graph metric, i.e. d(a,b) is the length 
of the shortest path from a to b. Our interest is in the effect of Bernoulli 
percolation on the geometry of the hypercube. 

In jH] Hastad, Leighton and Newman show that if p is taken to be con- 
stant, then for large n the distortion between H n and H np is at most a 
constant with probability tending to 1. If a graph is deemed to represent 
a computer network, then they were interested in the latency of embedding 
H n in Hn tP , which measures the loss of computational power resulting from 
eliminating a subset of the processors (vertices) or communication channels 
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(edges). The latency of an embedding between graphs is slightly stronger 
then metric distortion. 

In the classic paper j2] of Ajtai, Komlos and Szemeredi, bond percolation 
on the hypercube is analyzed. It is shown that if p = an -1 with a < 1, then all 
connected components of H n ^ have size polynomial in n (hence logarithmic in 
the size of H n ), while if a > 1 there is a single giant component of size linear 
in the size of H n , and of diameter of order n. When all connected components 
are small it is evident that there is no map with a small distortion between 
the cube and any of the components. However, when a giant component 
exists it is a priori possible that the distortion between the cube and the 
giant component is constant. Thus there is a gap between p = n^ 1 and p 
constant where it is possible that the distortion is small. 

To make things more precise, we use the following measure of the distor- 
tion between metric spaces. 

Definition 1. The distortion of a map f between metric spaces (X, dx), (Y, dy) 



Note that since we will deal with functions that are not necessarily 1 
to 1, the definition accommodates the possibility that two points a, b are 
mapped to the same point in Y. This does not constitute a significant change 
from other definitions. It should be noted though that this definition is not 
symmetric, as the functions do not need to have a full image. 

Two graphs (or metric spaces) with a small metric distortion between 
them have several similar properties. For more information on metric dis- 
tortion see [7j and reference there. Note that the distance between disjoint 
connected components of a graph is infinite, so only maps that preserve con- 
nected components are of interest. 

Note that (for bond percolation) a graph G and G p have the same ver- 
tex sets, so there is a natural map between the two. However, there is no 
guarantee (and generally it is not the case) that this map gives a minimal 
distortion or even gets close to doing that. However, in the case considered 
here, it will turn out that when the metric distortion is small, there is a map 
that approximates the identity map which gives a small distortion. 



is given by D(f) = D + (f)/D_(f) 




The metric distortion between X and Y , D(X,Y), is 



inf D(f). 
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Theorem 2. Let p = n a . 

1. If a < 1/2 then there is some constant c = c(a) such that 

F{D{H n ,H n>p ) <c) -> 1 

as n — > oo. 

2. If a > 1/2 i/ien t/iere zs some constant (3 = /3(a) > such that 

F(D(H n ,H n , p )<nP)^0 

as n — > oo . 

A similar theorem holds for site percolation, as well as for mixed perco- 
lation (with threshold at pbp s = n" 1 ). The essentially identical proof will be 
omitted. 

It follows from the definition of the latency of an embedding (see [6 ) that 
for p = n~ a with a < 1/2 the latency is polynomial in n. This follows from 
Theorem EJ as the latency is at most n D( -f\ It is plausible (though not proved 
here) that when a > 1/2 the latency is super-polynomial in n. 

The phase transition in this context is linked below to the p c < p u conjec- 
ture in context of infinite nonamenable graphs. In this light it is interesting 
to compare the behavior of the metric distortion of percolation in the hy- 
percube to the behavior in other natural graphs. For the n-box in Z d the 
distortion between the box and the giant component is O(logn). While not 
constant, this is small in relation to the diameter and there is no additional 
phase transition beyond the percolation threshold. For percolation on the 
complete graph, the metric distortion is given by the diameter of the giant 
component. On the hypercube the giant component has diameter 9{n) a.s. 
for any p > p = an' 1 with a > 1. 

Regarding the proof, the main idea for p = n~ a with a < 1/2, is that 
two vertices that are nearby in H are likely to stay nearby in H ntP . Thus the 
identity map is almost good enough, as it gives a constant distortion for most 
pairs of vertices. To find a map that gives a constant distortion requires a 
somewhat finer choice of the map, though the distance from x to f(x) is still 
bounded. 

When a > 1/2 the proof is based on showing that while the cube con- 
tains "a lot" of geodesic cycles, the giant component for a > 1/2 is locally 
very treelike and contains much fewer geodesic cycles and thus distortion is 
generated. 

In this regime there is a constant probability that two adjacent vertices 
that are both in the giant component are not joined by a short path in H UtP , 
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but will be at distance proportional to the diameter. Of course, since for 
the cube (and much more generally, [2| [H|) the giant component is known 
to be unique, the distance between the two vertices is finite. However, there 
is no longer a direct correlation between the metric of the cube and that 
of the giant component. As giant components are the finite graph analogue 
of infinite percolation clusters, this behavior can be seen as the finite graph 
analogue of non-uniqueness of infinite clusters. For graphs where p u — the 
threshold for uniqueness of the infinite cluster — is strictly greater than p c 
there is a regime where two nearby vertices have a positive probability of 
being in disjoint infinite components . That strict inequality was established 
in many cases of nonamenable infinite graphs, see e.g. [H] [3]. We believe 
that the distortion in this case is indeed linear in the diameter. However, 
since we have not eliminated possible maps that are far from preserving the 
geometry we only give a lower bound of rfi for some (3. 

Thus we propose that an analogue in finite graphs of having non-unique 
infinite components is that the metric in the giant component is very different 
from the original graph metric. In particular, with probability bounded away 
from 0, neighbors in the graphs are at distance of the order of the diameter 
in the giant component. For more on percolation on finite graphs see jHj- 

An additional aspect of the phase transition at a = 1/2 is the routing 
complexity. The routing problem is to find a path in H n p between two given 
vertices, preferably a short one. The routing problem may be defined under 
one of several models, the strictest of which involves starting at x and only 
being allowed to query edges incident on vertices that have already been 
reached from x. We refer to this as the local model. 

For p = n~ a with a < 1/2, it turns out that there is a polynomial time (in 
n) algorithm for routing in the local model, outputting a path of length 0(n) 
whenever x, y are in the same component. This is a consequence of the fact 
that distance between neighboring vertices is typically bounded, and so to get 
from x to a nearby y only a ball of constant radius (and polynomial volume) 
needs to be explored. On the other hand when a > 1/2 no polynomial 
algorithm exists. For more details on complexity of routing in the hypercube 
and other scenarios see [3]. 

We now proceed to present the proof of Theorem^! The proof is separated 
into the super-critical and the sub-critical cases, each requiring a different 
approach. 
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2 The Super-Critical Case: a < 1/2 



To illustrate the proof we first show a weaker result, namely that the typical 
distortion of the identity map is constant. 

Proposition 3. Let p = n~ a with a < 1/2. For some I = 1(a), with 
probability tending to 1 the percolation distance between two neighbors in H n 
is T most 2/ + 1. 

Sketch of Proof. We consider paths of length 21 + 1 between them making I 
steps in distinct coordinates, making a step in the coordinate that differs x 
from y, and then retracing the first I steps to reach y. The number of such 
paths is roughly n l and each is open with probability p 2l+1 . The expectation 
of X: the number of open paths is n( 1_2a ^~ a (l + o(l)), and for some / the 
exponent is positive. 

A pair of paths is unlikely to have a large intersection. The second mo- 
ment of X can be approximated by the square of the expectation. The largest 
error term comes from pairs of paths that intersect only in their first step, so 
that EX 2 = (EX) 2 (1 + n 2a - 1 + o(n 2a_1 )). The second moment method (see 
[l]) yields that 

(¥ X) 2 

F(X = 0) < 1 - = (1 + o(l))n 2 ^ -> 0. 

□ 

Similarly, the idea in the following proof is that there are many paths of 
some length between nearby vertices, and that one of them is very likely to 
be open. However, in order to get an exponential bound on the probability 
of failure we choose the set of paths we consider carefully. 

Fix I so that (1 — 2a)/ > 9a, which is possible as a < 1/2. Set m = |_f+2 J 
and fix throughout this section some partition of the coordinates into disjoint 
sets of size m denoted A, B, C\, . . . , C\ (with at least one coordinate left 
unused). A vertex in H np will be called good if it has at least 2m vertices at 
distance 2 (in H ntP ) differing only in A-coordinates. We first show that good 
vertices are dense: 

Lemma 4. Let p = n~ a for some a < 1/2. With probability tending to 1, 
every vertex v has is some good vertex u in H n p differing from v only in a 
single B-coordinate. 

Proof. First we show that the probability that a vertex is good tends to 1 
as n — ► oo, with a fixed. To this end we count open paths of length 2 using 
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A-coordinates starting at v. If there are more that 4m such paths, then they 
connect v to at least 2m. vertices and v is good. The probability that v is 
incident on at least m-p/2 = cin 1 ~ a open A-edges tends to 1 as n — > oo. On 
this event there are at least (m — l)cin 1 ~ a edges that extend these to paths 
of length 2. With probability tending to 1 at least fraction p/2 of these are 
open giving a total of at least c 2 n 2_2a open paths. Since for n large enough, 
c 2 n 2 ~ 2a > 4m, the probability that v is good tends to 1. 

Fix some v and let u\, . . . , u m be it's neighbors along B coordinates. Since 
the itj's all differ in their ^-coordinates, the events of the u/s being good 
are independent. For large enough n each Ui is good with probability at 
least 1 — 3~ l and then the probability that none of the itj's is good is at 
most 3- rnl < 2- n . Thus, a union bound over all vertices v shows that with 
probability tending to 1 every v has a good neighboring u. □ 

Proof of Theorem\^(l). We construct a map / that takes each vertex x to 
some good neighbor of x by changing a single B coordinate. By the above 
Lemma, with probability tending to 1 such a map exists. Since d(x, f(x)) = 
1, it follows that d(f(x),f(y)) > d(x,y) — 2 and so D-(f) > 1/3. To show 
that distances do not increase by more then some constant factor, it suffices 
to show that with probability tending to 1 the percolation distance between 
f(x) and f(y) for all neighboring x, y is bounded by 21 + 9. 

The distance between f(x) and f(y) is at most 3. First, fix some coor- 
dinate e where f(x) and f(y) differ. In the remainder there are some slight 
variations according as to which set e is part of. Since f(x) is good it has 2m 
vertices at percolation distance 2, differing only in the ^-coordinates. Let 
Xi, . . . ,x m be m of those that do not differ from f(x) in the e coordinate 
(If e G A there are at least m such neighbors; Otherwise all 2m are suit- 
able). Similarly define y\, . . . , y n . Note that Xi differs from in at most 7 
coordinates. 

We look for an open path from Xi to y% of length 21 + 9 for some i, 
and bound the probability that there is no such path. The second moment 
method is used to bound the probability of having no path for some i. The 
sets of paths we will consider are chosen so that distinct z's will be indepen- 
dent. 

To each i we associate a unique coordinate bi G B. If e G B we use 
instead of e the remaining coordinate not in any of the sets. For a sequence 
c = (ci, . . . , q) of coordinates with G we define the path -Pj(j) from Xi 
to yt as the path making I steps in the coordinates given by c, making a step 
in the coordinate bi, making steps in the coordinates where Xi and i/j differ, 
and then retracing the / + 1 steps given by % and c in reversed order, thereby 
arriving at i/j. If e G Ck for some k then we again replace it by the unused 
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coordinate. 

We now claim that paths -Pj(c) and Pj'(c') for % ^ %' are disjoint. Indeed, 
suppose that some v is in some Pi(c). According to the e coordinate of v we 
can say whether v is in the first or second half of the path. Assume w.log. it 
is the first. By the distance from f(x) we can determine the exact position 
of v in the path. Finally, either v differs from f(x) in the bi coordinate, and 
then we can say what i is, or it differs from f(x) in exactly 2 a coordinates 
and then we can say which is Xi. In any case, we can determine % from v. 

Let X be the number of open paths from Xi to z/i. 

EX = p 2l+9 m l = cn^- 2 ^ l - 9a , 

where c is a constant depending only on I. For I = 1(a) as specified above 
the the exponent is positive, and so the expected number of paths tends to 
infinity. The second moment of X is given by 

EX 2 = J2p 4l+18 ~ lPnP ' 1 
p,p' 

< (EX) 2 (j2 P~ 2km ~ k + m- l p- 21 - 9 

\ k 

< (EX) 2 (l + cn 2a - 1 ). 

And so the probability of having X > tends to 1 as n — > oo. In 
particular, for large enough n, the probability of having no open path for 
any of the i is less then (3~') m . In this case there is an open path of length 
21 + 13 between f(x) and f(y). Since the number of pairs x, y is n2 n <ti 3 lm , 
a union bound shows that asymptotically almost surely distances increase by 
a factor of no more than 21 + 13. □ 



3 The Sub-Critical Case: a > 1/2 

For two vertices x, y at some fixed distance from each other, the number of 
paths of length / between them grows like n 1 ^ 2 , and so the probability of 
having an open path tends to as n — > oo. This shows that there is no map 
with bounded d(x, f(x) with small distortion. However, the image of a map 
need not cover all pairs x, y, and there will be some pairs where paths will 
exist. 

A geodesic cycle in a graph is an isometric embedding of a cycle in the 
graph. The cube has a great number of geodesic cycles passing through any 
given vertex. These can be found by making any number of steps in distinct 
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coordinates, and then repeating the same sequence of coordinates to return 
to the starting point. 

A geodesic cycle (as well as a geodesic path) is a structure that is roughly 
preserved by a map with a small distortion. The key idea of the proof is 
to show that the percolated cube does not have many cycles, restricting the 
possible maps. 

Lemma 5. If a map from G to H has distortion D, and G has a geodesic 
cycle C of length 21 passing through a vertex v, then there is a simple (self 
avoiding) cycle C' C H of length V G [l/2D,Dl] so that f{v) is at distance 
at most 2D 2 from C' . 

The key idea is to look at a cycle in H that is the "image" of C, and 
remove any small loops it has to extract a simple cycle. 

Proof. Recall that the distortion is D = D+/D-. Suppose the vertices of 
C are t>o, i>i, . . . , v\ with v o = vi = v, and their images in H are X{ = f{vj). 
Since d{x^ x i+ i) < D + , there are simple paths of length at most D + between 
them, and these paths may be joined to form a (not necessarily simple) cycle 
Co in H of length at most D + l, passing through all the Xj's in order. The 
paths from Xi to x i+ i for various i's will be denoted as arcs. 

Next we extract from Co a simple cycle by a erasing loops it includes. 
Note that the procedure we use is not the standard procedure known as loop 
erasure. Suppose some vertex u appears twice in Co- Since u cannot be 
used twice in the same arc, it appears in distinct arcs. If u appears in the 
arc (xi,Xi+i) and in (xj,Xj+i) then there is a path of length at most 2D + 
between Xi and Xj. However, d(xi,Xj) > D_d(vi,Vj), and so d(vi,Vj) < 2D. 
Since the v^s form a geodesic cycle, breaking Co at the two occurrences of u 
results in one segment containing all but at most 2D of the x^s. 

Since the above holds for any 2 appearances of u in C , for each vertex 
u G Co there is a single segment of Co starting and ending at u, including all 
appearances of u, and including at most 2d of the x^s. We now repeatedly 
remove from Co the longest such segment corresponding to any repeated 
vertex. This choice guarantees that if a loop starting and ending at u is 
removed, then u itself will not be removed later. Since at each step we 
remove at most 2D of the x^s and create a vertex that cannot be removed 
at a later stage, eventually a simple cycle is left of length at least 1/2D. 

Finally, if xq = f(v) is not in the remaining cycle, it is in a loop that was 
removed from the cycle at some stage. Since the length of each such loop is 
at most 2DD + , it follows that f(v) is close to the cycle. □ 

Lemma 6. Let p = n~ a with a > 1/2, and 7 < 2a — 1. For large enough n, 
the probability that an vertex v is included in an open simple cycle in H U)V of 
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length 21 e [2n^, 2n 7 ] is at most n 1 + n ' 3 (i 3 + 1 2q 0. Furthermore, the probability 
that v is within distance 5 of such a cycle is at most n s + 1 + n ' 3 (f 3 + 1 - 2a ) _ 

Proof. First we estimate the number of simple cycles of length 21 in H n that 
include v. Since a cycle makes an even number of moves in each coordi- 
nate, the steps of a cycle of length 21 can be partitioned into I pairs of (non 
consecutive) moves in the same coordinate. The number of partitions of 21 
elements into pairs is (21 — 1)!!. The number of ways to choose a coordinate 
for each of the pairs is n . The probability that a cycle of length 21 is open 
is p 21 = n~ 2al , so the probability of v being in an open cycle of length 21 is 
at most (21 - ly.^n 1 - 20 ) 1 . 

This bound is decreasing in / as long as 21 < n 2a_1 , so when summing 
over for large enough n the first term is the largest: 

n< 
l=n?> 

Finally, the number of vertices within distance 6 of v is at most n 5 , and the 
second claim follows from the first. □ 

Proof of Theorem\B(2). Fix positive constants /3, 7 so that (3 + 7 < 2a — 1 
and 7 > 3/3. Assume that there is a map / : H n —>■ H n p with distortion at 
most n". Since vertices at distance greater than n 13 are mapped to distinct 
vertices, the range of / has size at least 2 n n~ nis . Since every vertex in H n is 
in a geodesic cycle of length n 7 , every vertex in the range is at distance at 
most 2n 2/3 from a simple cycle of length I e [n 1 " 13 , 2n 7+/3 ]. 

However, Lemma |H1 bounds the probability that a vertex has this property, 
and thus the expected number of such vertices. The probability of having at 
least 2 n n~ n0 vertices close to such cycles is at most 

on^2n 2 ' 9 +l+(7-/3+l-2a)nT-' 3 

f 71 _ n 2n 2 > 3 +n> 3 +l+(7-/3+l-2 a )n^-' 3 

2 n n~ nf3 

This tends to as n — > 00 since the dominant term in the exponent is n 7_/3 
with a negative coefficient. 

Thus the probability of having a map with distortion n 13 tends to 0. The 
above constraints allow taking (3 = (2a — l)/4 — e . □ 

4 Problems 

• What is the critical window? What happens if p = an~ x l 2 l Is there 
some a c so that above a c the distortion is constant or perhaps logarith- 
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mic in n while below it the distortion is logarithmic in n or even larger? 
What do the above proofs yield in this case? 

• When a > 1/2, what is the true behavior of the distortion? Since the 
diameter of H n is n the distortion can not be larger, but the proof only 
gives a lower bound of n 13 for any (3 < (2a — l)/4. 

• When a < 1/2, what is the true distortion? The proof of Theorem El 
gives a bound D + = 0((l/2 — a) -1 ). The proof of Proposition |3] 
suggests a bound of D + = 1 + [ 1 ^_ a j . We believe this is the correct 
uniform bound on the distortion. 

• What is the minimal latency of an embedding of H n in H np when a < 
1/2? Show that it is super-polynomial in n. Does it grow exponentially 
in n? 

• Let G be a vertex transitive graph. Perform p-bond percolation on 
G. Assume that the median distance between neighbors in G, after 
percolation is k, show that the metric distortion between G and the 
percolation giant component is at least k. 

• In particular, assume a vertex transitive graph G has girth g, is it true 
that the distortion between G and a percolation subgraph of G, with 
p bounded away from 1, is a.s. at least gl 

Acknowledgments. Thanks to Eran Ofek and Udi Wieder for useful dis- 
cussions. 
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